Speech Modulation Features for Robust Nonnative Speech Accent Detection

نویسندگان

  • Sam Sethserey
  • Xiong Xiao
  • Laurent Besacier
  • Eric Castelli
  • Haizhou Li
  • Chng Eng Siong
چکیده

In this paper, we propose to use speech modulation features for robust nonnative accent detection. Modulation spectrum carries long term temporal information of speech and may discriminate accents of native and nonnative speakers. For each speech segment to be tested, we extract a 10 dimension feature vector from modulation spectrum and use it for model training and testing. The proposed modulation features are compared with other popular features such as pitch and formant on a nonnative French accent detection task. Results show that the modulation features produce good detection performance and are quite robust to channel distortions. In addition, when combine test scores of modulation features and pitch features, performance is further significantly reduced. The best equal error rate is 13.1% by fusing pitch and modulation-based systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perception of Nonnative Accent: A Cross-Sectional Perspective Pilot Survey

Accent bias is a consequence of ethnocentrism. No studies have examined accent bias across educational levels in the U.S., much less across students and professionals in speech language pathology (SLP), a field that requires multicultural sensitivity training. This study examines nonnative accent perception among three groups—high schoolers, SLP students, and SLP professionals. One-hundred-and-...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

Monologic vs. Dialogic Assessment of Speech Act Performance: Role of Nonnative L2 Teachers’ Professional Experience on Their Rating Criteria

Few, if any, studies have investigated the effect of professional experience as a rater variable and type of assessment as a task variable on raters’ criteria in the assessment of speech acts. This study aimed to explore the impact of nonnative teachers’ professional experience on the use of criteria in monologic and dialogic assessment of 12 role-plays of 3 apology speech acts. To this end, 60...

متن کامل

Perceptual Dimensions of Nonnative Speech

Foreign-accented speech has most commonly been characterized across three related, but independent dimensions: intelligibility, comprehensibility, and accent [6]. The present study applied an auditory free classification task, which has been used to test listeners’ perceptual representations of regional dialects [5] and different languages [3], to further investigate the salient perceptual dime...

متن کامل

Investigating Pitch Accent Recognition in Non-native Speech

Acquisition of prosody, in addition to vocabulary and grammar, is essential for language learners. However, it has received less attention in instruction. To enable automatic identification and feedback on learners’ prosodic errors, we investigate automatic pitch accent labeling for nonnative speech. We demonstrate that an acoustic-based context model can achieve accuracies over 79% on binary p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011